Statistical properties of bootstrap estimation of phylogenetic variability from nucleotide sequences. I. Four taxa with a molecular clock.
نویسندگان
چکیده
The statistical properties of sample estimation and bootstrap estimation of phylogenetic variability from a sample of nucleotide sequences are studied by using model trees of three taxa with an outgroup and by assuming a constant rate of nucleotide substitution. The maximum-parsimony method of tree reconstruction is used. An analytic formula is derived for estimating the sequence length that is required if P, the probability of obtaining the true tree from the sampled sequences, is to be equal to or higher than a given value. Bootstrap estimation is formulated as a two-step sampling procedure: (1) sampling of sequences from the evolutionary process and (2) resampling of the original sequence sample. The probability that a bootstrap resampling of an original sequence sample will support the true tree is found to depend on the model tree, the sequence length, and the probability that a randomly chosen nucleotide site is an informative site. When a trifurcating tree is used as the model tree, the probability that one of the three bifurcating trees will appear in > or = 95% of the bootstrap replicates is < 5%, even if the number of bootstrap replicates is only 50; therefore, the probability of accepting an erroneous tree as the true tree is < 5% if that tree appears in > or = 95% of the bootstrap replicates and if more than 50 bootstrap replications are conducted. However, if a particular bifurcating tree is observed in, say, < 75% of the bootstrap replicates, then it cannot be claimed to be better than the trifurcating tree even if > or = 1,000 bootstrap replications are conducted. When a bifurcating tree is used as the model tree, the bootstrap approach tends to overestimate P when the sequences are very short, but it tends to underestimate that probability when the sequences are long. Moreover, simulation results show that, if a tree is accepted as the true tree only if it has appeared in > or = 95% of the bootstrap replicates, then the probability of failing to accept any bifurcating tree can be as large as 58% even when P = 95%, i.e., even when 95% of the samples from the evolutionary process will support the true tree. Thus, if the rate-constancy assumption holds, bootstrapping is a conservative approach for estimating the reliability of an inferred phylogeny for four taxa.
منابع مشابه
Phylogenetic relationships of the commercial marine shrimp family Penaeidae from Persian Gulf
Phylogenetic relationships among all described species (total of 5 taxa) of the shrimp genus Penaeus, were examined with nucleotide sequence data from portions of mitochondrial gene and cytochrome oxidase subunit I (COI). There are twelve commercial shrimp in the Iranian coastal waters. The reconstruction of the evolution phylogeny of these species is crucial in revealing stock identity that ca...
متن کاملPhylogenetic relationships of the commercial marine shrimp family Penaeidae from Persian Gulf
Phylogenetic relationships among all described species (total of 5 taxa) of the shrimp genus Penaeus, were examined with nucleotide sequence data from portions of mitochondrial gene and cytochrome oxidase subunit I (COI). There are twelve commercial shrimp in the Iranian coastal waters. The reconstruction of the evolution phylogeny of these species is crucial in revealing stock identity that ca...
متن کاملMolecular Characterization and Phylogeny Analysis Based on Sequences of Cytochrome Oxidase gene From Hemiscorpius lepturus of Iran
Abstract: Background: Hemiscorpius lepturus is a medically important scorpion found along the Iranian borders, especially near to Khuzestan Province in the south-west of Iran. This is the only non-buthid scorpion which is potentially lethal in southern Iran and is responsible for severe dermonecrotic scorpionism. OBJECTIVES: In this study, DNA fragment of the mitochondrial cytochrome c oxidase ...
متن کاملPHYLOGENETIC RELATIONSHIPS BETWEEN IRANIAN ISOLATES OF MICROSPHAERA AND ERYSIPHE S. LAT. BASED ON rDNA INTERNAL TRANSCRIBED SPACERS SEQUENCES
To study the phylogenetic relationships between Erysiphe s. lat. and Microsphaera, the nucleotide sequences of internal transcribed spacers ofrDNA including 5.8S rDNA gene were determined for 23 taxa. The results showed that Erysiphe. section Erysiphe and Microsphaera are closely related and clustered together with strong bootstrap support (100%). All oftaxa belonging to this group produce coni...
متن کاملMolecular detection and phylogenetic properties of isolated infectious bronchitis viruses from broilers in Ahvaz, southwest Iran, based on partial sequences of spike gene
Infectious bronchitis (IB) is a highly contagious disease involving mostly upper respiratory tract in chickens, leading to significant economic losses in the poultry industry worldwide. One of the major concerns regarding to IB is the emergence of new types of infectious bronchitis viruses (IBVs). The purpose of this study was to identify the IBVs isolated from Iranian broiler chickens with res...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Molecular biology and evolution
دوره 9 6 شماره
صفحات -
تاریخ انتشار 1992